Improvements in Protein Function Prediction Using Confidence in Protein Interactions
نویسندگان
چکیده
Characterizing protein function is a crucial part of understanding biological systems. Here we improve protein function prediction by accounting for data quality issues inherent in protein-protein interaction (PPI) databases. To accomplish this, we incorporate confidence information into the function prediction pipeline. The model pipeline uses weighted majority voting on the proteinprotein interaction network, with weights defined by shortest paths distance, confidence, diffusion state distance (DSD), or DSD with confidence. The result is that incorporating confidence weights in general significantly helps improve protein function prediction. Confidence with DSD performs especially well, improving by 11.9 pp over Majority Vote with ordinary shortest paths distance and no confi-
منابع مشابه
Prediction of Protein Sub-Mitochondria Locations Using Protein Interaction Networks
Background: Prediction of the protein localization is among the most important issues in the bioinformatics that is used for the prediction of the proteins in the cells and organelles such as mitochondria. In this study, several machine learning algorithms are applied for the prediction of the intracellular protein locations. These algorithms use the features extracted from pro...
متن کاملProtein Secondary Structure Prediction: a Literature Review with Focus on Machine Learning Approaches
DNA sequence, containing all genetic traits is not a functional entity. Instead, it transfers to protein sequences by transcription and translation processes. This protein sequence takes on a 3D structure later, which is a functional unit and can manage biological interactions using the information encoded in DNA. Every life process one can figure is undertaken by proteins with specific functio...
متن کاملDiscovering Domains Mediating Protein Interactions
Background: Protein-protein interactions do not provide any direct information regarding the domains within the proteins that mediate the interactions. The majority of proteins are multi domain proteins and the interaction between them is often defined by the pairs of their domains. Most of the former studies focus only on interacting domain pairs. However they do not consider the in...
متن کاملStudy of PKA binding sites in cAMP-signaling pathway using structural protein-protein interaction networks
Backgroud: Protein-protein interaction, plays a key role in signal transduction in signaling pathways. Different approaches are used for prediction of these interactions including experimental and computational approaches. In conventional node-edge protein-protein interaction networks, we can only see which proteins interact but ‘structural networks’ show us how these proteins inter...
متن کاملP-30: The Effect of The T26248G Polymorphism on Putative MethyltransferaseNsun7 Protein Function and Its Role in Male Infertility
Background: Male infertility has many causes, including genetic infertility. The NOP2/Sun domain family, member7 (Nsun7) gene, which encodes putative methyltransferase Nsun7, has a role in sperm motility. The aim of the present study was to investigate the effect of the T26248G polymorphism on Nsun7 protein function and its role in male infertility. Materials and Methods: Semen samples were col...
متن کامل